Search CORE

8 research outputs found

Joint Entity Extraction and Assertion Detection for Clinical Text

Author: Bhatia Parminder
Celikkaya Busra
Khalilia Mohammed
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Negative medical findings are prevalent in clinical reports, yet discriminating them from positive findings remains a challenging task for information extraction. Most of the existing systems treat this task as a pipeline of two separate tasks, i.e., named entity recognition (NER) and rule-based negation detection. We consider this as a multi-task problem and present a novel end-to-end neural model to jointly extract entities and negations. We extend a standard hierarchical encoder-decoder NER model and first adopt a shared encoder followed by separate decoders for the two tasks. This architecture performs considerably better than the previous rule-based and machine learning-based systems. To overcome the problem of increased parameter size especially for low-resource settings, we propose the Conditional Softmax Shared Decoder architecture which achieves state-of-art results for NER and negation detection on the 2010 i2b2/VA challenge dataset and a proprietary de-identified clinical dataset.Comment: Accepted at the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019

arXiv.org e-Print Archive

Crossref

LATTE: Latent Type Modeling for Biomedical Entity Linking

Author: Bhatia Parminder
Celikkaya Busra
Reddy Chandan K.
Zhu Ming
Publication venue
Publication date: 20/01/2020
Field of study

Entity linking is the task of linking mentions of named entities in natural language text, to entities in a curated knowledge-base. This is of significant importance in the biomedical domain, where it could be used to semantically annotate a large volume of clinical records and biomedical literature, to standardized concepts described in an ontology such as Unified Medical Language System (UMLS). We observe that with precise type information, entity disambiguation becomes a straightforward task. However, fine-grained type information is usually not available in biomedical domain. Thus, we propose LATTE, a LATent Type Entity Linking model, that improves entity linking by modeling the latent fine-grained type information about mentions and entities. Unlike previous methods that perform entity linking directly between the mentions and the entities, LATTE jointly does entity disambiguation, and latent fine-grained type learning, without direct supervision. We evaluate our model on two biomedical datasets: MedMentions, a large scale public dataset annotated with UMLS concepts, and a de-identified corpus of dictated doctor's notes that has been annotated with ICD concepts. Extensive experimental evaluation shows our model achieves significant performance improvements over several state-of-the-art techniques.Comment: AAAI 2020 Conferenc

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Multivariate Continuous-Time Models: Approximate Inference Algorithms and Medical Informatics Applications

Author: Celikkaya Emine Busra
Publication venue: eScholarship, University of California
Publication date: 01/01/2016
Field of study

Temporal modeling of real-life systems, such as social networks, financial markets and medical decision-support systems, is important to understand them better, and make predictions. Temporal data of these systems have irregular time granularity; therefore continuous-time models are a natural fit. In this thesis, we focus on one of the building blocks in this process, statistical inference. Additionally, we apply continuous-time models to a medical informatics application. In our inference algorithms, we focus on continuous-time Markov processes (CTMPs). Answering queries about the components as a CTMP evolves involves inferring the probability distributions of the state of the system at query time points. When the number of components is large, exact inference becomes intractable since the state space is exponentially large in the number of components. Structured representations provide a framework to apply inference methods in an efficient way. Such representations usually also discretize time. However, choosing the right time- width is challenging since the observations are not synchronized among components and there might be large intervals without any observations. Therefore, our inference algorithms use the structured representation of continuous-time Bayesian networks (CTBNs), but they can also be applied to other continuous-time representations. A CTBN provides a compact representation using local dependencies. Unfortunately, exploiting the structure in the dynamics does not alleviate the need to represent the full joint space, and exact inference in CTBNs is intractable. Our approximate inference computations concentrate on the key calculation for a CTMP, the matrix exponential. We use two different expansion of the matrix exponential to derive different approximation algorithms. Our first algorithm keeps the solution in the factorized state space by using uniformization. It is the first non-sampling method to have bounded error. Also, it has better experimental results than the previous methods. Our second algorithm is built upon the sum of time-ordered products. It combines the advantages of deterministic and sampling methods as it is deterministic and anytime. It converges to the true distribution in the limit of infinite computation time, and it is not random. Random methods such as sampling methods can lead to instability when used inside parameter estimation algorithms. We show that it performs as well as or better than the current best sampling approaches on benchmark problems. Our last work is an application of a multivariate Gaussian process (MGP) to a medical informatics problem. We estimate the blood gas values of a patient during mechanical ventilation in a pediatric intensive care unit. Frequent blood gas values allow more responsive care which can reduce the duration of ventilation and risk of lung injury. Estimating these values from non-invasive measurements can reduce the number of invasive blood tests, which are challenging in children. We estimate them by using previous values of all variables, and current values of all non-invasive variables. We develop an MGP model because the variables are naturally continuous. Our results show promising prediction accuracies, which could be used to automate the ventilation process

Ezid

eScholarship - University of California